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Abstract 

Various  types  and  techniques  of  on-line  approximation 
have  been  used  in  feedback  control  of  uncertain  non¬ 
linear  systems.  In  many  practical  applications,  satu¬ 
ration  of  the  control  input  influences  significantly  the 
performance  of  adaptive  and  learning  control  systems. 
This  article  addresses  the  issue  of  control  input  satu¬ 
ration  in  on-line  approximation  based  control  of  non¬ 
linear  systems.  A  modified  control  design  framework 
is  presented  for  preventing  the  presence  of  input  sat¬ 
uration  from  destroying  the  learning  capabilities  and 
memory  of  an  on-line  approximator  in  feedback  con¬ 
trol  systems.  The  stability  properties  of  the  proposed 
feedback  control  law  are  obtained  via  Lyapunov  analy¬ 
sis.  Particular  emphasis  is  given  to  aircraft  longitudinal 
control,  which  extends  the  results  to  the  backstepping 
feedback  control  procedure. 

1  Introduction 

A  variety  of  feedback  control  approaches  have  been 
developed  to  deal  with  nonlinear  systems,  including 
feedback  linearization  [5] ,  sliding  mode  control  [9] ,  and 
backstepping  [11].  In  their  ideal  form,  both  feedback 
linearization  and  backstepping  rely  on  cancellation  of 
known  nonlinearities.  To  address  the  issue  of  uncer¬ 
tainty,  several  “robustifying”  techniques  have  been  de¬ 
veloped:  (i)  adaptive  methods  deal  with  parametric  un¬ 
certainty  [7],  where  the  nonlinearities  are  assumed  to 
be  known  but  some  of  the  parameters  that  multiply 
these  nonlinearities  are  unknown  or  uncertain;  (ii)  ro¬ 
bust  methods  [2]  deal  with  the  case  where  known  upper 
bounds  on  the  unknown  nonlinearities  are  available  and 
therefore,  they  tend  to  be  conservative,  sometimes  lead¬ 
ing  to  high-gain  feedback;  (iii)  robust  adaptive  methods 
[14]  combine  parametric  uncertainty  and  unknown  non- 
linearities  with  partially  known  bounds. 

The  above  control  techniques  are  based  on  the  assump¬ 
tion  that  the  plant  nonlinearities  are  either  known  or 
can  be  bounded  by  some  known  functions.  In  many 
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applications,  including  control  of  high  performance  air¬ 
craft  and  uninhabited  air  vehicles  (UAVs),  some  of  the 
nonlinearities  need  to  be  approximated  on-line.  This 
may  be  due  to  modeling  errors  during  the  identifi¬ 
cation/modeling  phase  or,  quite  often,  due  to  time- 
variations  in  the  dynamics  as  a  result  of  changes  in 
the  operating  conditions  or  due  to  component  wear  or 
battle  damage.  To  address  the  issue  of  unknown  non- 
linearities,  various  control  system  architectures  have  in¬ 
corporated  various  network  models  as  on-line  approxi¬ 
mators  of  unknown  nonlinearities. 

The  application  of  on-line  approximation  methods  to 
nonlinear  systems  in  a  feedback  framework  yields  a 
complex  nonlinear  closed-loop  system,  which  is  ana¬ 
lyzed  using  Lyapunov  stability  methods.  Typically,  the 
feedback  control  law  and  the  adaptive  law  for  updat¬ 
ing  the  network  weights  are  derived  by  utilizing  a  Lya¬ 
punov  function,  whose  time  derivative  is  forced  to  have 
some  desirable  stability  properties  (for  example,  nega¬ 
tive  semi-definiteness).  Therefore,  the  stability  of  the 
closed-loop  system  is  obtained  during  the  synthesis  of 
the  adaptive  control  laws.  Examples  of  this  type  of 
approach,  which  is  referred  to  as  Lyapunov  synthesis 
method,  include  [3,  4,  10,  13,  15,  16]. 

From  a  practical  perspective,  one  of  the  key  problems 
in  feedback  control  systems  is  that  the  signal  u(t)  gen¬ 
erated  by  the  control  law  cannot  be  implemented  due 
to  physical  constraints.  A  common  example  of  such 
constraint  is  input  saturation,  which  imposes  limita¬ 
tions  on  the  magnitude  of  the  control  input.  In  some 
applications  this  problem  is  crucial,  especially  in  com¬ 
bination  with  nonlinear  on-line  approximation  based 
control,  which  tends  to  be  aggressive  in  seeking  the 
desired  tracking  performance.  In  aircraft  control  ap¬ 
plications,  input  saturation  is  caused  by  limitations  in 
control  surface  deflections.  For  UAVs,  the  absence  of 
humans  in  the  air  vehicle  may  allow  more  aggressive 
maneuvering,  however  the  feedback  control  law  has  to 
deal  both  with  unknown  nonlinearities  and  input  sat¬ 
uration. 
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Input  saturation  in  an  adaptive  linear  control  frame¬ 
work  has  been  addressed  in,  for  example,  [1,  6,  8,  12]. 
One  possible  approach  is  to  completely  stop  adapta¬ 
tion  during  saturation  of  the  control  input.  While  this 
ad-hoc  method  does  prevent  the  tracking  error  induced 
by  actuator  constraints  from  corrupting  parameter  es¬ 
timation,  the  stability  properties  of  the  closed-loop  sys¬ 
tem  cannot  be  established.  Another  approach  that  has 
been  proposed,  which  we  refer  to  as  training  signal 
hedging  (TSH),  see  e.g.  [1,  8],  modifies  the  tracking 
error  definition  used  in  the  parameter  update  laws.  Fi¬ 
nally,  a  third  approach,  referred  to  as  pseudo-control 
hedging  (PCH),  alters  the  commanded  input  to  the 
loop  [6,  12].  The  idea  behind  the  PCH  approach  is 
to  attenuate  the  command  to  the  loop  so  that  the  gen¬ 
erated  control  signal  is  implementable  without  satura¬ 
tion. 

This  article  addresses  the  issue  of  control  input  satura¬ 
tion  in  on-line  approximation  based  control  systems.  A 
modified  control  design  framework  is  presented  for  pre¬ 
venting  the  presence  of  input  saturation  from  destroy¬ 
ing  the  learning  capabilities  and  memory  of  an  on-line 
approximator  in  feedback  control  systems.  The  design 
method  is  based  on  the  TSH  approach.  The  stability 
properties  of  the  proposed  feedback  control  law  are  ob¬ 
tained  via  Lyapunov  analysis.  Two  design  schemes  are 
presented:  the  first  is  based  on  a  direct  learning  control 
scheme  and  the  second  is  based  on  an  indirect  learning 
control  scheme.  Particular  emphasis  is  given  to  aircraft 
longitudinal  control,  which  extends  the  results  to  the 
backstepping  feedback  control  procedure. 

2  Problem  Formulation 

To  facilitate  a  more  intuitive  understanding  of  the  is¬ 
sues  of  input  saturation  in  adaptive  and  learning  con¬ 
trol  systems,  we  start  with  a  simple  scalar  system. 
Later  on,  the  design  and  analysis  techniques  are  ex¬ 
tended  to  a  class  of  higher-order  systems.  Consider  the 
scalar  nonlinear  system 

x  =  fo(x)  +  f(x)  +  g(x)u  (1) 

where  x  €  1Z  is  the  measured  output,  /o  is  a  known 
function,  f{x)  and  g(x)  are  unknown  nonlinear  func¬ 
tions  to  be  approximated  on-line,  and  u  is  the  control 
signal.  Suppose  that  the  control  objective  is  for  x(t)  to 
track  xr(t),  which  is  the  output  of  a  reference  model 

xr  =  —axr  +  f3r ,  (2) 

where  a  >  0,  /3  are  known  parameters  and  r{t)  is  a  mea¬ 
surable  command  signal.  Using  on-line  approximation 
methods,  f(x,9f)  =  9j(j>f( x)  and  g(x,0g)  =  Oj  (j)g(x ) 
are  used  as  approximators1  of  fix)  and  g(x)  respec¬ 
tively,  where  9f,  9g  are  the  adjustable  parameters  of 

1  For  simplicity,  in  this  article  we  use  linearly  parametrized 
approximators.  The  case  of  nonlinearly  parametrized  approxi¬ 
mators  can  also  be  considered  by  appropriate  handling  of  the 
higher-order  terms  [15]. 


the  on-line  approximator  in  vector  form,  and  <f>f,  (j>g, 
are  the  coresponding  regressors  or  basis  functions.  Us¬ 
ing  standard  techniques  from  adaptive  and  on-line  ap¬ 
proximation  based  control,  a  feedback  control  law  of 
the  form 

u  =  1.  \-f0(x)  -  f(x,0f)  -  ax  +  fir]  (3) 

9{X,0g)  1  J 

Of  =  T f(x  -  xr)<j>f(x)  (4) 

9g  =  Tg{x  ~  Xr)(j)g{x)u  (5) 

can  be  designed  to  achieve  the  desired  tracking  objec¬ 
tive  (where  Tj,  Tg  are  positive  definite  matrices  repre¬ 
senting  the  learning  rate  of  the  on-line  approximation) . 
The  stability  properties  of  the  above  feedback  control 
law  can  be  derived  under  certain  conditions.  Specif¬ 
ically,  if:  (i)  the  Minimum  Functional  Approximation 
Error  (MFAE)  between  f(x),  g{  x)  and  f{x,  9f),  g(x,  9g) 
respectively  is  zero;  and  (ii)  the  update  law  (5)  is  ap¬ 
propriately  modified  to  guarantee  that  g(x,  9g)  does  not 
approach  zero  (to  avoid  stabilizability  problems),  then 
it  can  be  shown  using  Lyapunov  analysis  that  all  the 
variables  remain  bounded  and  the  tracking  error  con¬ 
verges  to  zero  (see,  e.g.,  [4,  13,  15,  16]). 

In  many  applications  (including  aircraft  systems),  the 
control  law  described  by  (3)  may  not  always  be  real¬ 
izable  due  to  actuator  constraints  such  as  saturation. 
Assume  that  the  control  input  u  is  constrained  by  the 
following  known,  constant  upper/lower  limit  bounds: 
ul  <  u{t)  <  ujj .  In  a  more  general  setting,  the  limit 
bounds  ul,  ujj  may  be  functions  of  the  state  x. 

Due  to  actuator  saturation,  the  actual  feedback  learn¬ 
ing  control  law  being  implemented  is  different  from  (3), 
(4),  (5)  as  follows: 

u  =  sat  (u0,uL,uu)  (6) 

u0  =  1.  ~fo{x)  -  f{x, Of)  -ax  +  (3r  (7) 

9{x,9g)  L  J 

Of  =  T f(x  -  xr)<j>f(x)  (8) 

9g  =  Tg{x  -  xr)(j)g{x)u  (9) 

where  the  saturation  function  “sat”  is  linear  with  unity 
slope  between  its  lower  and  upper  limits;  i.e., 


r  ul 

if 

u0  <  uL 

sat [uo,  U L,  Uu)  =  l  Uq 

if 

ul  <  u0  <  uv 

{  Uu 

if 

u0  >  Uu 

Even  though  the  problem  formulation  has  been  sim¬ 
plified  to  illustrate  the  key  issues  more  clearly,  it  can 
be  readily  shown  that  the  modified  control  law  (6)-(9), 
with  saturation  limits,  cannot  guarantee  the  stability  of 
the  closed-loop  system.  In  practice,  the  control  design 
problem  maybe  more  complicated  due  to  the  presence 


of  several  actuators  that  are  available  in  the  implemen¬ 
tation  of  the  control  signal. 

A  key  issue  is  what  happens  to  learning  during  satura¬ 
tion.  It  is  expected  that  during  saturation  the  magni¬ 
tude  of  the  tracking  error  will  increase,  since  the  control 
signal  is  not  being  achieved.  This  tracking  error  is  not 
the  result  of  function  approximation  error,  therefore  we 
need  to  be  careful  so  that  the  approximator  does  not 
cause  “unlearning”  during  the  period  when  the  actua¬ 
tors  are  saturated. 

How  do  we  modify  the  adaptive  laws  (8),  (9)  for  up¬ 
dating  Of,  9g  during  saturation?  Clearly,  the  adap¬ 
tive  laws  (8),  (9)  depend  on  the  tracking  error  x  —  xr, 
therefore  if  the  tracking  error  increases  due  to  satura¬ 
tion,  the  adaptive  law  may  cause  a  significant  change 
in  the  weights  in  response  to  the  increase  in  tracking 
error  (the  tracking  error  is  driving  the  adaptation).  A 
very  simple  solution  may  be  to  turn  off  adaptation  in 
the  event  of  saturation.  However,  this  is  not  necessar¬ 
ily  the  best  solution.  Next,  we  develop  two  modified 
learning  control  schemes  to  address  the  input  satura¬ 
tion  problem. 

3  Direct  Learning  Control  Scheme 

The  first  approach  is  based  on  a  direct  learning  control 
framework.  According  to  this  approach,  the  standard 
feedback  learning  control  law  (6)-(9)  is  modified  as  fol¬ 


lows: 

u  =  saX(uQ,UL,uu)  (10) 

u0  =— “XT  ~fo(x)  ~  f(x>9f)  ~  ax  +  fir  +  iill) 
9\x,9g)  l  J 

0/  =  Tf(x-xr  ~x)<t>f{x)  (12) 

0g  =  T g(x  -  xr  -  x)(t>g{x)u  (13) 

X  =  ~ax  +  ff(xJg)  (u-  u0)  (14) 


The  above  modified  control  law  involves  the  utilization 
of  the  x  signal,  which  is  a  filtered  version  of  the  effect 
of  input  saturation  on  the  variable  being  controlled. 
Note  that  in  the  case  of  no  input  saturation,  then  x 
remains  zero  and  the  control  law  becomes  the  same 
as  the  standard  learning  control  law  described  in  the 
previous  section.  In  the  presence  of  input  saturation, 
X  is  non-zero,  thus  giving  rise  to  a  modified  tracking 
error  x  =  x  —  xr  —  X-  The  signal  rj  in  the  specification 
of  uq  in  equation  (11)  will  be  used  later  on  to  deal  with 
the  presence  of  functional  approximation  error. 

Next,  we  proceed  to  investigate  the  stability  proper¬ 
ties  of  the  closed-loop  system,  based  on  the  learning 
control  law  described  by  (10)-(14).  Consider  the  Lya¬ 
punov  function  V  =  \x 2  +  ^0jTJ10f  +  ^6jTg1Qg, 
where  9Z  =  9Z  —  9*  and  2  G  {/,  g}.  The  “optimal” 
parameter  vector  9j  is  defined  as  the  weight  vector  of 
the  on-line  approximator  that  minimizes  the  functional 


approximator  error  between  f(x)  and  f(x,9f)  over  a 
compact  (i.e.,  closed  and  bounded)  operating  envelope 

V  (correspondingly  for  g(x)): 

6*  =  argmin  (  sup  | z(x)  -  z(x,  9Z)\  )  ,  2  G  {/,  g} 

e~  \xGV  J 

The  minimum  possible  distance  between  f{x)  and 
f(x,9f)  over  V  is  referred  to  as  minimum  functional 
approximation  error  (MFAE)  over  T>  and  is  denoted  by 
ez{x,V)  =  z(x)  -  z(x,9z)\0z=0,  for  2  G  {/,  g}.  The 
bound  on  ez  is  denoted  by  Ez  =  sup^g-p  \ez{x,  V) |.  It  is 
noted  that  ez,  9*^,  9*  are  required  only  for  the  theoret¬ 
ical  analysis,  not  for  the  design  of  the  learning  control 
law.  On  the  other  hand,  the  bound  Ez  is  required  also 
in  the  design  in  order  to  robustify  the  learning  scheme 
with  respect  to  the  possible  presence  of  MFAEs.  The 
requirement  of  assuming  a  known  bound  on  the  MFAE 
can  be  relaxed  by  using  adaptive  bounding  techniques 
[13],  where  the  bound  is  estimated  on-line  using  adap¬ 
tive  methods. 

By  taking  the  time  derivative  of  V  along  the  solutions 
of  (1),  (2),  (12)-(14)  and  taking  into  consideration  the 
implemented  control  law  (10),  (11),  we  obtain 

V  =  x  (/o  +  /  +  gu  +  axr  -  (3r  +  ax~  g(u  -  u0)) 

+  (9j  (j)f  +  9j  (j)gUj 

=  —ax2  +  x  (ef  +  egu  +  rf) 

<  —  ax2  +  \x\E  +  xg  (15) 

where  £  is  a  bound  on  the  combined  approximation 
error  £/  +  £gu  over  the  operating  envelope  V. 

In  the  ideal  case  of  zero  approximation  error,  77  can  be 
set  to  zero,  thus  yielding  V  =  —  ax2.  If  the  approxima¬ 
tion  error  is  non-zero,  but  an  upper  bound  E  is  available 
(known) ,  then  the  robustifying  term  77  could  be  chosen 
as  77  =  —k  sgn(x),  where  sgn  denotes  the  sign  function 
and  k  >  0  is  a  constant  gain  function  satisfying  k  >  E. 
In  this  case,  |a;|£  +  a;?7  =  —(k  —  E) \x\  <  0.  Alternatively, 
77  could  be  chosen  as  77  =  —koq(x),  where  ko  >  0  is  a 
positive  constant  and  q  is  an  odd  function  of  x.  In  this 
case,  it  can  be  readily  shown  that  \x\E  +  xr)  <  0  for 
all  x  outside  the  region  Q  =  {x  |  sgn(x)gr(a;)  >  E/ko}, 
which  can  be  made  arbitrarily  small  by  increasing  the 
value  of  the  design  constant  ko-  This  second  approach 
has  the  advantage  of  avoiding  the  use  of  the  discontin¬ 
uous  sign  function,  which  can  cause  chattering  in  the 
feedback  loop. 

Since  the  time  derivative  V  is  negative  semi-definite  and 

V  =  —  x2,  we  obtain  that  x,  9f ,  9g  remain  bounded 
and,  using  Barbalat’s  Lemma,  x(t)  converges  to  zero 
as  t  goes  to  00.  It  is  important  to  note  that  these 
results  are  based  on  the  assumption  that  9j  <j>(x)  does 
not  approach  zero.  In  order  to  achieve  that,  some  type 
of  projection  modification  [15]  is  required  in  the  update 


of  0g(t).  This  required  modification  is  independent  of 
the  saturation  issue  since  it  would  have  been  needed 
even  in  the  absence  of  any  saturation  constraints. 

4  Indirect  Learning  Control  Scheme 

In  the  previous  section  we  considered  a  direct  type  of 
learning  scheme,  in  the  sense  that  the  tracking  error  is 
driving  the  adaptation.  Another  approach  is  to  con¬ 
sider  an  indirect  learning  control  scheme  where  the 
adaptation  (learning)  is  driven  by  the  identification  er¬ 
ror  instead  of  the  tracking  error.  Intuitively,  it  appears 
that  such  a  scheme  may  be  more  robust  to  actuator 
saturation. 


Based  on  the  simple  system  (1),  consider  the  identifi¬ 
cation  model 

x  =  - ax  +  ax  +  fo(x)  +  f(x,  Of)  +  g(  x,  0g)u  —  g,  (16) 

where  g  is  to  be  designed  to  address  the  robustness 
issue  in  the  presence  of  approximation  errors.  Let  the 
identification  error  be  defined  as  ej(t)  =  x(t)  —  x(t). 
From  (1),  (16)  we  obtain 

e/  =  -ae/+(/( x)  -  f(x,0f)^  +  (g(x)  -  g(x,Og)^j  u+g. 

(17) 

We  proceed  to  design  a  feedback  control  law  which  in¬ 
corporates  input  saturation,  similar  to  the  scheme  de¬ 
veloped  in  Section  3,  and  then  we  will  analyze  the  sta¬ 
bility  properties  of  the  indirect  learning  control  scheme. 
The  overall  indirect  scheme  is  described  by 


u  =  sa,t(uo,UL,ujj) 
1 


(18) 


M0  = 


g(x,0g) 

Of  =  Tf{x-x)<j)f{x) 

Og  =  Tg(x  -  X)(j>g(x)U 


~fo(x)  -  f(x,  Of)  -  ax  +  Pr  +  #9) 


(20) 

(21) 


x  =  ax  —  ax  +  fo{x)  +  f(x,0f)  +  g(x ,  0g)u  —  i{ |22) 


In  the  above  indirect  scheme,  we  note  that  the  adap¬ 
tation  is  driven  by  the  identification  error  e/(f)  in¬ 
stead  of  the  modified  tracking  error  used  in  the  di¬ 
rect  scheme.  Furthermore,  the  x-filter  of  the  direct 
scheme  has  been  replaced  by  the  identification  esti¬ 
mator  (22).  To  investigate  the  stability  properties 
if  this  scheme  consider  the  Lyapunov  function  V/  = 
+  ^0jTJ18f  +  ^OjTg1^.  The  time  derivative  of 
Vi  along  the  solutions  of  (17),  (20),  (21)  satisfies 

V  =  -aej  +  ei  (ef  +  £gu  +  g) 

<  -aej  +  \ei\s  +  eig  (23) 


To  obtain  the  desired  stability  properties  g  can  be  cho¬ 
sen  as  g  =  koq(ei),  where  fco  >  0  and  q  is  an  odd 
function.  Therefore,  V  <  —aej,  which  implies  that  the 
parameter  estimates  remain  bounded  and  the  identifi¬ 
cation  error  e/(f)  converges  to  zero.  Note  that,  since 


this  is  an  indirect  scheme,  the  results  are  obtained  in¬ 
dependent  of  whether  saturation  is  reached  or  not. 


Now  lets  investigate  what  happens  to  the  tracking  error 
x  =  x  —  xr.  By  combining  (1),  (2)  and  using  the  control 
law  (18),  (19)  we  obtain 

x  =  -ax  +  (/( x)  -  f(x,  0f)J  +  (g(x)  -  g(x,  0g ))  u 
+  r)  +  g(x,0g)(u-uo)  (24) 

Note  that  the  tracking  error  dynamics  for  x(t)  are  sim¬ 
ilar  to  the  identification  error  dynamics  e/(t),  with  the 
exception  of  the  last  term  g(x,0g)(u  —  uq).  In  fact,  it 
can  be  readily  shown  that  for  appropriately  chosen  ini¬ 
tial  conditions  x(0),  we  have  e/(<)  =  x(t)  —  x(f)>  where 
X(t)  is  generated  (same  as  previously)  by 


X  =  -ax  +  g{x,0g)(u-uo),  x(0)  =  0.  (25) 

To  verify  this  relationship,  let  z  =  e/  —  x(t)  +  xW- 
From  (17),  (24),  (25)  it  can  be  easily  shown  that  the 
^-dynamics  satisfy  z  =  —az.  Therefore  if  x(0)  =  av(0), 
then  z  will  be  zero,  which  implies  that  e/(t)  =  x(t)  — 
x(t).  Note  that  even  if  the  initial  conditions  are  not 
satisfied,  the  difference  between  e/(t)  and  x(t)  —  x(t) 
will  exponentially  decay  to  zero. 


Since,  as  shown  earlier,  e/(t)  converges  to  zero,  this 
implies  that  x(t)  —  x(t)  converges  to  zero,  which  is  the 
same  result  as  obtained  in  the  previous  section  using 
the  direct  learning  scheme  method.  In  conclusion,  the 
direct  and  indirect  learning  schemes  are  implemented 
differently  but  they  yield  the  same  stability  properties. 

5  Saturation  in  Backstepping 

In  this  section,  we  extend  the  previous  results  to  the 
case  of  a  backstepping  feedback  control  procedure. 
Specifically,  we  consider  the  backstepping  control  of  the 
(7,  a,  Q)  loops  of  an  aircraft  model  [17],  where  7  is  the 
flight  path  angle,  a  is  the  angle  of  attack  and  Q  is  the 
pitch  rate.  The  tracking  error  dynamic  equations  are 
described  by 


7 


a 

4 


gS  {CL  +  CLa  {ac  +  a)) 
mVt  cos  ((3) 
qS{CL  +  CLaa) 


T  f'y  7c 


mVt  cos(P) 
c5P  R  —  c6(P2  —  R2) 

+  C7  q  S  c  (Cm  +  uq)  -  Q 


+  Qc  +  Q  +  fa  — 


(26) 

dc(27) 

(28) 


where  jc,  ac,  Qc  are  the  commanded  flight  path  angle, 
commanded  angle  of  attack  and  commanded  pitch  rate 
respectively,  and  7,  a,  Q  are  the  corresponding  track¬ 
ing  errors.  For  exact  definition  of  the  variables  and 
constants  in  the  aircraft  model  dynamics,  please  refer 
to  [17].  The  objective  in  this  section  is  to  incorporate 
the  modified  learning  control  scheme  of  Section  3  for 


dealing  with  saturation  of  the  actuator  signals  and  in¬ 
vestigate  its  effect  on  the  learning  algorithms. 

The  functions  Cl,  Cl a  and  Cm  are  unknown  and 
are  approximated  on-line  by  the  following  linearly- 
parameterized  approximators:  Cl  =  <j>(x)',  Cxa  = 

®cL  Cm  =  OlM<j>{X),  where  x  is  the  vehicle  state 

vector  that  contains  the  most  dominant  elements. 


Since  the  unknown  functions  Cl  and  Cl a  also  appear 
in  the  a  dynamics,  we  delay  the  selection  of  the  up¬ 
date  laws  for  9cl  and  0cL  until  the  next  step  of  the 
backstepping  procedure. 

Next  we  proceed  to  the  a-dynamics.  Let  a(t)  =  a(t)  — 
Xa(t)  be  the  modified  tracking  error  of  the  a  dynamics, 
where  Xaif)  is  generated  by 


Let  7(f)  =  7 (t)  —  X7(f)  be  the  modified  tracking  error 
of  the  7  dynamics,  where  X7(t)  is  generated  by 

— £1 

X  =  —  fc7x7  H - — - t^CLo,  (ac  -  a°c)  . 

mVt  cos(p) 

where  ac(t)  =  sat(a°,  ,  a^),  and  a a are  the 
lower  and  upper  limits,  respectively,  of  the  control  sig¬ 
nal  ac.  The  dynamics  of  the  modified  tracking  error 
7  =  7  -  Xj  satisfy 

=  men-*  K+a))  + 

mVt  cos  (p) 

-  (n,eLa  (ac  -  a°c)  (29) 

mVt  cos (0)  v  ’ 

The  nominal  control  law  is  given  by 


T  f'y  7c  T  ^"7  X';.' 


-CLa  (ac  -  a°) 


= 


(-/7  +  7c  -  fc7  7) 


For  simplicity,  in  this  section  we  do  not  consider  the  ro¬ 
bustness  issue  with  respect  to  residual  approximation 
errors,  however,  it  can  be  directly  addressed  using  the 
term  as  in  Section  3.  Specifically,  we  assume  that  the 
residual  approximation  error  is  zero  and  the  robustify- 
ing  term  v1  =  0. 

Note  that  the  term  Xa,  which  appears  in  the  control 
law  (30)  is  the  saturation  filter  for  the  a-dynamics,  and 
it  will  appear  in  the  next  step  of  the  backstepping  pro¬ 
cedure.  By  substituting  (30)  in  (29)  we  obtain  after 
some  algebraic  manipulation 


(6cL(/>(x,y)  +  0cLa(/>(x,y)aJ  . 


The  derivative  of 


^  _  ,  _  ,  q  s  cLa{ot  -  Xa) 

7  ~  kj7+  mVt  cos(/3) 

~  m  v"L(I3) 

Define  the  7-Lyapunov  function  as  V7  = 

I  (#CLrcl  ter.  +  tilLa  Tcla  0cLa  )  •  The  deriva 
V7  along  solutions  of  the  7  dynamic  equation  is 

v7  =  + 

mVt  cos(p) 

+  (ticL  TCl  m  ytScom<t> 7 

,  AT  ri-1  (  A  Q  S 


V7  =  — A772  +  7- 


qT  p-1 
iCLLCL 


0Cl  -  TCl 


7GLa  1  CLa 


0cL„  -  TCl 


m  Vt  cos  (/3) 


Xa  kaXa  T  (Q  c  Qc)  • 

where  Qc  =  sat(Q°,  Q Q(!)  and  Q%,  are  the  lower 
and  upper  limits,  respectively,  of  the  control  signal  Qc. 

The  dynamics  of  the  modified  tracking  error  a  =  a— Xa 
satisfy 

,  _  qS{CL  +  CLaa)  ,  ^  r  .  ,  , 

a  ~ - 77 - T^. - 1"  Qc  +  Q  +  Ja  ~  OLc  +  KaXa 

mVt  cos(p) 

-  (Qc  -  Q°c )  (31) 

The  nominal  (non-saturated)  control  law  for  the  a- 
dynamics  is  given  by 


Qc  = 


qS  (dL  +  CL,  a) 
mVt  cos(/3) 


kaQL  fa  T  dc 


-  qSCLa 

-  Tn  V,  cos(/3)  -  W  (32) 

In  practice,  dc  is  not  available  for  measurement  and 
hence  is  approximated  by  a  signal  of  the  form  bc  = 
T  7|-i  Q^c-  For  notational  simplicity  we  avoid  this  ap¬ 
proximation  in  the  current  analysis  and  assume  that 
dc  is  available.  By  substituting  (32)  in  (31),  the  mod¬ 
ified  tracking  error  dynamics  for  a  satisfy 


a  =  —kaa  +  Q  —  XQ 


q  S  CLa 
m  Vt  cos  (/3) 


Q  S 

to  Vt  cos(P) 


(( CL  ~  CL)  +  (cLa  -  CLa)  a ) 


Define  the  (7,  a)  Lyapunov  function  as  V(7jQ;)  =  V7  + 
|a2.  After  some  algebra,  it  can  be  shown  that  the 
derivative  of  V(7)0)  along  solutions  of  the  (7,  a)  dy¬ 
namic  equations  is 


F(7,a)  (fc77  T  kaa  )  T  (Q  Xa) 

,  AT  r,-l  (a  Q  S 


+  %LjcL 


0cL„  ~TCl 


q  S  a 

m  Vt  cos(/3) 


<j>a  (7  —  a) 


Based  on  the  above  Lyapunov  function  derivative,  we 
now  select  the  adaptive  laws  for  0cL  and  9cL  : 


ecL  =  F  Cl — (33) 
m  Vt  cos(p) 


=  rCL 


to  Vt  cos  ((3) 


4>a  (7  —  a)  (34) 


In  the  third  and  final  step  we  proceed  to  the  Q- 
dynamics.  Let  Q(t)  =  Q(t)  —  xq(<)  be  the  modified 
tracking  error  of  the  Q  dynamics,  where  Xg(t)  is  gen¬ 
erated  by 

XQ  =  ~kQXQ  +  C7  q  S  c(uQ-  u°Q )  , 

and  UQ(t)  =  sat (v.q,Uq,Uq).  The  dynamics  of  the 
modified  tracking  error  Q  =  Q  —  \Q  satisfy 

Q  =  c5P  R- c6(P'2  -  R2)  +  c7  q  S  c(CM  +  uQ) 
-Qc  +  kQXQ  -c7  q  S  c  ( UQ  -  Uq)  (35) 

The  nominal  (non-saturated)  control  law  for  the  Q- 
dynamics  is  given  by 

o  p  ~^qQ  -  d  +  Qc—  ( c5P  R  -  c6(P2  -  -R2)) 

UQ  =  ~Cm+ - - • 

(36) 

By  substituting  (36)  into  (35)  the  modified  tracking 
error  dynamics  for  Q  satisfy 

Q  =  -kQQ  -  a  +  c7  q  S  c  (cM  -  CM^j  ■ 

Define  the  (7,  a,  Q)  Lyapunov  function  as 

,a,Q)  =  v(7,a)  +  2 Q2  +  2  (^Mrc^c«)  • 

The  derivative  of  V(7]Q>q)  along  solutions  of  the 
(7,  a,  Q)  dynamic  equations  and  update  laws  for  the 
weights  of  the  on-line  approximators  is 

V(7 ,a,Q)  =  ~  (fc772  +  kaa2  +  kcjQ~) 

+  -  r cMC7  q  S  c^q) 

Therefore,  we  select  the  adaptive  law  for  9c M  as  fol¬ 
lows: 

0cM  =  rCMc 7  Q  S  C(j)Q. 

If  (fc7,fcQ,fcg)  are  all  positive  then  V(7jCtiQ)  is  nega¬ 
tive  semi-definite.  Therefore,  using  Barbalat’s  Lemma 
the  modified  tracking  errors  7,  a,  Q  converge  to  zero 
asymptotically. 

6  Conclusions 

This  article  has  presented  an  approach  for  dealing  with 
control  input  saturation  in  uncertain  nonlinear  system, 
where  the  uncertainty  is  approximated  on-line  with 
learning  techniques.  A  key  issue  has  been  the  modifica¬ 
tion  of  the  update  laws  associated  with  the  weights  of 
the  on-line  approximator,  such  that  the  presence  of  sat¬ 
uration  limits  do  not  cause  instability  in  the  closed-loop 
system  nor  “unlearning”  in  the  on-line  approximation 
process.  The  presented  design  and  analysis  procedure 
first  considered  the  case  of  a  first-order  plant,  and  later 
was  extended  to  a  backstepping  control  scheme  based 
on  an  aircraft  vehicle  model. 
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